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ABSTRACT 

This paper presents the clustering properties of hard (2-8 keV) X-ray se- 
lected sources detected in a wide field (~ 2deg 2 ) shallow [/x(2 — 8keV) ~ 
10~ 14 erg cm -2 s _1 ] and contiguous XMM-Newton survey. We perform an an- 
gular correlation function analysis using a total of 171 sources to the above flux 
limit. We detect a ~ 4ct correlation signal out to 300arcsec with w(9 < 300") ~ 
0.13 ± 0.03. Modeling the two point correlation function as a power law of the 
form w{9) = (Bo/ey- 1 we find: o = 48.9^1:5 arcsec and 7 = 2.2 ± 0.30. Fixing 
the correlation function slope to 7 = 1.8 we obtain 9 a = 22.2^8^ arcsec. Using 
Limber's intergral equation and a variety of possible luminosity functions of the 
hard X-ray population, we find a relatively large correlation length, ranging from 
r ~ 9 to 19 hr 1 Mpc (for 7 = 1.8 and the concordance cosmological model), 
with this range reflecting also different evolutionary models for the source lumi- 
nosities and clustering characteristics. The relatively large correlation length is 
comparable to that of extremely red objects and luminous radio sources. 

Subject headings: galaxies: active — quasars: general — surveys — cosmology: 
observations - - large-scale structure of the universe — X-rays: diffuse back- 
ground 



1. Introduction 



It is well known that the study of the distribution of matter on large scales, using 
different extragalactic objects provides important constraints on models of cosmic structure 
formation. Since Active Galactic Nuclei (AGN) can be detected up to very high redshifts 
they provide information on the underlying mass distribution as well as on the evolution of 
large scale structure (cf. Hartwick & Schade 1989; Basilakos 2001 and references therein). 
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The traditional indicator of clustering, the angular two-point correlation function, is a 
fundamental and simple statistical test for the study of any extragalactic mass tracer and 
is relatively straightforward to measure from observational data. The overall knowledge of 
the AGN clustering using X-ray data comes mostly from the soft (< 3keV) X-ray band 
(Boyle & Mo 1993; Vikhlinin & Forman 1995; Carrera et al. 1998; Akylas, Georgantopoulos, 
Plionis, 2000; Mullis 2002), which is however biased against absorbed AGNs. Hard X-ray 
surveys (> 2keV) play a key role in our understanding of how the whole AGN population, 
including obscured (type II) AGNs, trace the underlying mass distribution. Furthermore, 
understanding the spatial distribution of type II AGNs is important since they are among 
the main contributors of the cosmic X-ray background (Mushotzky et al. 2000; Hasinger et 
al. 2001; Giacconi et al. 2002). 

Recently, Yang et al. (2003) performing a counts-in cells analysis of a deep (f2~8kev ~ 
3 x 10~ 15 erg s" 1 cm" 2 ) Chandra survey in the Lockman Hole North- West region, found 
that the hard band sources are highly clustered with ~ 60% of them being distributed in 
overdense regions. The XMM-Newton with ~ 5 times more effective area, especially at hard 
energies, and ~ 3 times larger field of view (FOV) provides an ideal instrument for clustering 
studies of X-ray sources. 

In this paper we estimate for the first time the angular correlation function of the 
XMM-Newton hard X-ray sample. Using Limber's equation and different models of the 
luminosity funtion for these sources we derive the expected spatial correlation function which 
we compare with that of a variety of extragalactic populations. Hereafter, all independent 
quantities will be given in units of h = H o /100 km Mpc -1 . 

2. The Sample 

The hard X-ray selected sample used in the present study is compiled from the XMM- 
Newton/2dF survey. This is a shallow (2-10 ksec per pointing) survey carried out by the 
XMM-Newton near the North Galactic Pole [NGP; RA(J2000) = 13 h 41 m ; Dec.(J2000)=00°00'] 
and the South Galactic Pole [SGP; RA(J2000)=00 h 57 m , Dec.(J2000)=-28°00'] regions. A 
total of 18 XMM-Newton pointings were observed equally split between the NGP and the 
SGP areas. However, a number of pointings were discarded due to elevated particle back- 
ground at the time of the observation. This results in a total of 13 usable XMM-Newton 
pointings covering an area of m 2 deg 2 . A full description of the data reduction, source 
detection and flux estimation are presented by Georgakakis et al. (2003, 2004). For the 2-D 
correlation analysis presented in this paper we use the hard (2-8 keV) band catalogue of the 
XMM-iVei/;fon/2dF survey. This comprises a total of 171 sources above the 5<r detection 
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threshold to the limiting flux of f x (2 — 8keV) 10~ 14 ergs -1 cm -2 . Note that our hard 
X-ray sources comprise of a mixture of QSO's and relatively nearby (z < 0.8) galaxies with 
red colours g — r > 0.5 which are most probably associated with obscured low-luminosity 
AGN (Georgantopoulos et al. 2004). 

3. Correlation function analysis 

3.1. The angular correlation 

The clustering properties of the hard X-ray selected sources are estimated using the two 
point angular correlation function w{9) defined by dP = n 2 [l + w(9)]dQidQ\, where dP is 
the joint probability of finding two sources in the solid angle elements dfli and dQ 2 separated 
by angle 9 and n is the mean surface density of sources. For a random distribution of sources 
w(9)—0. Therefore, the angular correlation function provides a measure of galaxy density 
excess over that expected for a random distribution. A variety of estimators of w{9) have 
been used over the years (cf. Infante 1994). 

In the present study we use the estimator (cf. Efstathiou et al. 1991): 

w(9) = f(N DD /N DR ) - 1, (1) 

where N DD and N DR is the number of data-data and data-random pairs respectively at 
separations 9 and 9 + d9. In the above relation / is the normalization factor / = 2N R / (N^ — 
1) where and N R are the total number of data and random points respectively. The 
uncertainty in w(9) is estimated as a w = y/(l + w(9))/N DR (Peebles 1973). To account for 
the different source selection and edge effects, we have produced 100 Monte Carlo random 
realizations of the source distribution within the area of the survey by taking into account 
variations in sensitivity which might affect the correlation function estimate. Indeed, the 
flux threshold for detection depends on the off-axis angle from the center of each of the 
XMM- Newton pointings. Since the random catalogues must have the same selection effects 
as the real catalogue, sensitivity maps are used to discard random points in less sensitive 
areas (close to the edge of the pointings). This is accomplished, to the first approximation, 
by assigning a flux to the random points using the Baldi et al. (2002) 2-10 keV log N — log S 
(after transforming to the 2-8 keV band assuming T = 1.7). If the flux of a random point 
is less than 5 times the local rms noise (assuming Poisson statistics for the background) 
the point is excluded from the random data-set. We note that the Baldi et al. (2002) 
log N — log S is in good agreement with the 2-8 keV number counts estimated in the present 
survey. This is demonstrated in Figure 1 were we plot our differential number counts and 
the best fit relation of Baldi et al. (2002). Note that we have tested that our random 
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simulations reproduce both the off-axis sensitivity of the detector as well as the individual 
field log N- log S. 

We apply the correlation analysis evaluating w(9) in logarithmic intervals with 5\og9 = 
0.05. The results are shown in Figure 2, were the line corresponds to the best-fit power 
law model w{9) = (9 /9y / ~ 1 using the standard x 2 minimization procedure in which each 
correlation point is weighted by its error. We find a statistically significant signal with 
w(9 < 300 ) ~ 0.13 ± 0.03 at the 4.3a and ~ 2.7a confidence level using Poissonian or 
bootstrap errors respectively. Note that the bootstrap errors probably overestimate the true 
uncertainty, especially in sparse samples (Fisher et al 1994). Therefore the true significance 
level is somewhere in between the above two values. 

In the insert of Figure 2 we present the iso-A^ 2 contours (where A% 2 = \ 2 ~ Xmm) 
in the 7 — 9 Q plane. The contours correspond to la (A% 2 = 2.30) and la (A% 2 = 6.17) 
uncertainties, respectively. The best fit clustering parameters are: 

9 = 48.9l^;| arcsec 7 = 2.2 ± 0.30 (2) 

where the errors correspond to la (A% 2 = 2.30) uncertainties. Fixing the correlation function 
slope to its nominal value, 7 = 1.8, we estimate 9 Q = 22. 2jJ;g arcsec 3 . Note that our results 
do not suffer from the amplification bias which results from merging close source pairs when 
the PSF size is larger than their typical separation (see Vikhlinin & Forman 1995). This 
is because the estimated 9 Q values are much larger than the XMM-Newton PSF size of 6" 
FWHM. 

Another systematic effect that could bias the angular 2-point correlation function is 
that introduced by the so called integral constraint. This results from the fact that the 
correlation function is estimated from a limited area, which in turn implies that over the 
area studied the relation J J u>(# 12 )df2idf22 — should be satisfied. We can attempt to 
estimate the resulting underestimation of the true correlation function by calculating the 
quantity: W = J dQ\ J dQ2w{9i2)/ J dfii J d^2 Clearly, evaluating W necessitates a priori 
knowledge of the angular correlation function. A tentative value of W using a range of w(9) 
given by varying within la our results (eq.2) is: W ~ 0.02. By adding W to our estimated 
(raw) w(9) and fitting again the model correlation function we find: 

9 ~ 44 ± 20 arcsec 7 ~ 2 ± 0.25 , (3) 

consistent within the errors, with our uncorrected results (eq. 2). If we fix 7 = 1.8 we obtain 



3 The robustness of our results to the fitting procedure was tested using different bins (spanning from f 
to 20) and no significant difference was found. 
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9 a = 28 ± 9 arcsec. Due to the small effect of the integral constraint correction we will use 
in the rest of the paper our raw w{6) estimates. 

Our results show that hard X-ray sources are strongly correlated, even more than the 
soft ones (see Vikhlinin & Forman 1995; Yang et al. 2003; Basilakos et al. in preparation). 
Our derived angular correlation length o is in rough agreement, although somewhat smaller 
(within la) with the Chandra result of O — 40 ± 11 arcsec (Yang et al. 2003). The stronger 
angular clustering with respect to the soft sources could be either due to the higher flux limit 
of the hard XMM-Newton sample, resulting in the selection of relatively nearby sources, or 
could imply an association of our hard X-ray sources with high-density peaks of the underline 
matter distribution. To test the latter suggestion we have measured the cluster - hard X- 
ray sources cross-correlation function (w C; h a rd) using either the Goto et al. (2002) clusters, 
detected in the multicolour optical SDSS data, or the X-ray clusters that we have detected 
on our fields (Gaga et al. in preparation). We find no significant cross-correlation signal 
[u> c ,hard(@ < 300") ~ —0.15 ± 0.19], a fact that weakens the suggestion of association of the 
hard X-ray sources with high-density peaks. However, we must stress that the null result 
could be artificial, due to small number statistics. 



3.2. The spatial correlation length using w{6) 

The angular correlation function w{9) can be obtained from the spatial one, £(r), 
through the Limber transformation (Peebles 1980). If the spatial correlation function is 
modeled as 

£(r,z) = (r/r )-^l + z)-^ , (4) 

then for a flat Universe the amplitude 8 in two dimensions is related to the correlation 
length r (see Efstathiou et al. 1991) in three dimensions through the equation: 

where x(z) is the proper distance, E(z) = ^/£l m (l + z) 3 + is the element of comoving 
distance and if 7 = r(|)T( :i ^i)/T(^). Note that if e = 7 — 3, the clustering is constant 
in comoving coordinates (comoving clustering) while if e = — 3 the clustering is constant 
in physical coordinates. We perform the above inversion in the framework of either the 
concordance ACDM cosmological model (fi m — 1 — Q A — 0.3, H = 70km s _1 Mpc -1 ) or the 
Einstein-de Sitter model. 

The redshift distribution dN/dz and the predicted total number, N, of the X-ray sources 
which enter in eq. 5 can be found using the hard band luminosity functions of Ueda et al. 
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(2003) and of Boyle et al. (1998). We also use different models for the evolution of the 
hard X-ray sources: a pure luminosity evolution (PLE) or the more realistic luminosity 
dependent density evolution (LDDE; Ueda et al 2003). In Figure 3 we show the expected 
redshift distributions of the hard X-ray sources for three different luminosity functions and 
evolution models. Both the Boyle et al (1998) and Ueda et al (2003) luminosity functions 
with pure luminosity evolution give relatively similar dN/dz distributions. However, the 
LDDE model gives an dN/dz distribution shifted to much larger redshifts with a median 
redshift of z ~ 0.75 (see also Table 1). 

For the comoving clustering model (e = 7 — 3) and using the LDDE evolution model, 
we estimate the hard X-ray source correlation length to be: r Q = 19 ± 3 h^ 1 Mpc and 
r D = 13.5±3 h~ l Mpc for 7 = 1.8 and 7 = 2.2 respectively. While if e = — 3 the corresponding 
values are: r Q = 11.5 ± 2 hr l Mpc and r D = 6 ± 1.5 hr l Mpc, respectively. In Table 1, we 
present the values of the correlation length, r Q , resulting from Limber's inversion for different 
luminosity function and evolution models. 

These estimated clustering lengths (for 7 = 1.8) are a factor of > 2 larger than the 
corresponding values of the Lyman break galaxies (Adelberger 2000), the 2dF (Hawkins 
et al. 2003) and SDSS (Budavari et al. 2003) galaxy distributions as well as the 2QZ 
QSO's (Croom et al. 2001). However, the most luminous, and thus nearer, 2QZ sub-sample 
(18.25 < bj < 19.80) has a larger correlation length (~ 8.5 ± 1.7 h^ 1 Mpc) than the overall 
sample (Croom et al. 2002), in marginal agreement with our e = —3 clustering evolution 
results. 

The large spatial clustering length of our hard X-ray sources can be compared with that 
of Extremely Red Objects (EROs) and luminous radio sources (Roche, Dunlop & Almaini 
2003; Overzier et al. 2003; Rottgering et al. 2003) which are found to be in the range 
r D ~ 12 — 15 h^ 1 Mpc. The possible association of EROs with high-z massive ellipticals and 
of luminous radio sources with protoclusters (for a review see Rottgering et al. 2003 and 
references therein) suggests that our hard X-ray sources could trace the high peaks of the 
underline mass distribution (see also Yang et al. 2003). 

4. Conclusions 

In this paper we explore the clustering properties of hard (2-8 keV) X-ray selected sources 
using a wide area 2deg 2 ) shallow [/x(2 — 8keV) rs 10~ 14 erg cm -2 s _1 ] XMM- Newton 
survey. Using an angular correlation function analysis we measure a clustering signal at the 
~ 4(7 confidence level. Modeling the angular correlation function by a power-law, w{6) = 
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(6 /6)^\ we estimate 6 Q = 48.9l^J arcsec and 7 = 2.2 ± 0.30. Fixing the correlation 
function slope to 7 = 1.8 we estimate 9 a = 22.21^ arcsec. Using a variety of luminosity 
functions and evolutionary models the Limber's inversion provides correlation lengths which 
are in the range r Q ~ 10 — 19 h^ 1 Mpc, typically larger than those of galaxies and optically 
selected QSO's but similar to those of strongly clustered populations, like EROs and luminous 
radio sources. 
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the European Union and the Greek Government in the framework of the program 'Promotion 
of Excellence in Technological Development and Research', project 'X-ray Astrophysics with 
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Table 1: The hard X-ray sources correlation length (r D in h^ 1 Mpc) for different pairs of (7, e) 
and for the different luminosity functions and evolution models. The last column indicates 
the predicted median redshift, from the specific luminosity function used. The bold letters 
deliniate the prefered cosmological model and the most updated luminosity function. 



LF 


Evol. Model 


(f2 m , Q\) 


r (1.8,-1.2) 


r (1.8,-3) 


r Q (2.2,-0.8; 


l r Q (2.2, -3) 


z 


Boyle 


No evol. 


(1,0) 


11.5±2.00 


9.0±1.5 


7.3±1.2 


6.0±1.5 


0.45 


Ueda 


No evol. 


(1,0) 


9.5±1.5 


7.5±1.0 


6.5±1.5 


5.0±1 


0.40 


Boyle 


PLE 


(1,0) 


13.0 ±3.0 


10.0 ±2.0 


8.0±2.0 


6.8 ±1.5 


0.50 


Ueda 


PLE 


(0.3,0.7) 


13.0 ±2.0 


9.3 ±1.6 


9.0 ±2.0 


6.7 ± 1.5 


0.45 


Ueda 


LDDE 


(0.3-0.7) 


19.0 ±3.0 


11.5 ±2.0 


13.5 ±3 


8.5 ±2 


0.75 


Ueda 


LDDE 


(1,0) 


15.0 ±2.5 


7.8 ±1.5 


11.0±2.5 


6.0±1.5 


0.80 
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Fig. 1. — The hard band (2-8 keV) differential number counts from the present survey in 
comparison with those of Baldi et al. (2002; dashed line). 
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Fig. 2. — The two-point angular correlation function of the hard (2-8 keV) X-ray sources. 
The line represents the best-fit power law w(0) = (# /#) 7-1 with 6 Q = 48.9 arcsec and 
7 = 2.2. Insert: Iso-A^ 2 contours in the 7-6*0 parameter space. 
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Fig. 3. — The predicted iV 1 dN/dz distribution for three different LF models: (a) Boyle et 
al. (1998) with PLE (continuous line), (b) Ueda et al (2003) with PLE (dashed line), (c) 
Ueda et al (2003) with LDDE (dot-dashed line). 



